CDS

Accession Number TCMCG019C13117
gbkey CDS
Protein Id XP_022941909.1
Location complement(join(4220509..4220513,4220662..4221160,4221590..4222504))
Gene LOC111447126
GeneID 111447126
Organism Cucurbita moschata

Protein

Length 472aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA418582
db_source XM_023086141.1
Definition anthocyanidin 3-O-glucosyltransferase 5-like isoform X2 [Cucurbita moschata]

EGGNOG-MAPPER Annotation

COG_category CG
Description Belongs to the UDP-glycosyltransferase family
KEGG_TC -
KEGG_Module -
KEGG_Reaction R02594        [VIEW IN KEGG]
R03605        [VIEW IN KEGG]
R04005        [VIEW IN KEGG]
KEGG_rclass RC00005        [VIEW IN KEGG]
RC00171        [VIEW IN KEGG]
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
ko01003        [VIEW IN KEGG]
KEGG_ko ko:K12356        [VIEW IN KEGG]
EC 2.4.1.111        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko00940        [VIEW IN KEGG]
map00940        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGGACTCCCCAACCCACGTCGCTCTCATCTCAAGCCCCGGGATGGGCCACCTCTTCCCCTCTCTCGAGCTCGCCACGCGACTCTCCACGCGCCACCACCTCACCCTCACTGTTTTCCTCGTCACCTCCCACTCCTCCTCCGCCGAAAATAACGTCGTTGCCGCCGCCGAGGCCACTGGCCTCTTTACTGTCGTCGAACTCCCACCGGCTGACATGTCCGACGTCACCGATTCCACTGTCGTTGGCCGCCTTGCCATCACCATGCGCCGCCACGTCCCGGCTCTCCGCTCGGCCATCTCTGCTCTCACCTCCCGCCCCTCCGCCCTCATTGCAGACATCTTCTCCACCGAGGCCTTTGCCGTCGCCGACGAGTTCCACATGGCCAAATACGTCTTCGTCGCCTCTAATGCATGGTTTTTAGCCTTGACCATTTACGCCCAGGTTCTCGACAAGCAAATCGTCGGGCAGTACGTGGACCAGAAAGAACCGCTTCAAATCCCTGGATGCGAACCGGTTCGTCCATGTGACGTCGTAGACCCGATGCTGGACCGGACCGAATCCCAGTATTACGAGTACGTCAAAATGGGGAGGGCAATAGCGTCGAGCCACGGCGTTTTGGTTAACTCGTGGGATGAGTTGCAAGGTCGCACACTCGCATCGTTCAAAGATCGGAGTCTGTTGGGTCGAGTAATGAACGCGCCGGTTTACTCGATCGGACCGATCGTGCGACATTTCGGCTCTGGGAAAGACGGCTCGAGCGAGCTGTTCAACTGGTTGAGGAAGCAGCCCGGGAAGTCGGTGATTTACGTGTCGTTCGGGAGCGGCGGAACGTTGTCGTTTGAGCAAATGACGGAAATGGCTCATGGCTTGGAGTTGAGTCGGCAGAGATTTGTTTGGGTGGTCCGGCCGCCCACGGTGAGGTCGGATGCGATGTTTTTCACGACAGGGGATGGGAGTGAGGACCAATCAGAGGCGAGATATTTGCCGGAGGGGTTTTTGGAGCGGACTAGCGAGGTGGGGTTTCTGGTGTCGATGTGGGCGGAGCAAACGGCGGTGCTGGGGAGTCCGGCAGTGGGGGGATTTTTCACGCACGGCGGATGGAACTCATCATTGGAAGGAATTACGAAGGGAGTTCCGATGATAGTGTGGCCGTTGTACGCGGAGCAGAGGATGAACGCCACGATGCTGGCGGATGAGATGGGGGTAGCGGTGCGGCCGAAGGAGCTGCCAGGGAATGCGGTGATCGGGAGGGAGGAGATCGCGGCGATGGTGAGGAAGATAATGGCGGAGGAGGACGAAGAAGGGAGAGCCATAAGAGCGAAGGCGATGGAACTTCAACGAAGTGCAGAAAAGGCCTGTGCGCAAGGAGGCTCGTCGTACGAGAACTTTGCTCGAGTTGTGAAACTTTTTGGCCGTTGA
Protein:  
MDSPTHVALISSPGMGHLFPSLELATRLSTRHHLTLTVFLVTSHSSSAENNVVAAAEATGLFTVVELPPADMSDVTDSTVVGRLAITMRRHVPALRSAISALTSRPSALIADIFSTEAFAVADEFHMAKYVFVASNAWFLALTIYAQVLDKQIVGQYVDQKEPLQIPGCEPVRPCDVVDPMLDRTESQYYEYVKMGRAIASSHGVLVNSWDELQGRTLASFKDRSLLGRVMNAPVYSIGPIVRHFGSGKDGSSELFNWLRKQPGKSVIYVSFGSGGTLSFEQMTEMAHGLELSRQRFVWVVRPPTVRSDAMFFTTGDGSEDQSEARYLPEGFLERTSEVGFLVSMWAEQTAVLGSPAVGGFFTHGGWNSSLEGITKGVPMIVWPLYAEQRMNATMLADEMGVAVRPKELPGNAVIGREEIAAMVRKIMAEEDEEGRAIRAKAMELQRSAEKACAQGGSSYENFARVVKLFGR